Size of websites in incubator.apache.org

View: New views
16 Messages — Rating Filter:   Alert me  

Size of websites in incubator.apache.org

by Tony Stevenson-2 :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

Good day,

As part of rolling out the new backup server for the infra team, I have
discovered that several podling sites are extremely large.

Namely:

119M    /x1/www/incubator.apache.org/activemq
324M    /x1/www/incubator.apache.org/cxf
102M    /x1/www/incubator.apache.org/directory
166M    /x1/www/incubator.apache.org/lucene.net
587M    /x1/www/incubator.apache.org/openjpa
299M    /x1/www/incubator.apache.org/servicemix
166M    /x1/www/incubator.apache.org/uima


I am singling out all sites that over 100MB in size here.  Can someone
please check the contents of these directories?  I appreciate that some
of them have graduated from the incubator and as such, these datasets
are either redundant or should be archived.

I would appreciate a definitive directive as to what should be done with
  these directories.

I will also be updating the documentation on how to handle
graduation/removal from the incubator.  I'll send an update once this
has been done too.


Cheers,
Tony




---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@...
For additional commands, e-mail: general-help@...


Re: Size of websites in incubator.apache.org

by robert burrell donkin-2 :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

On Tue, Apr 22, 2008 at 5:22 PM, Tony Stevenson <tony@...> wrote:
> Good day,
>
>  As part of rolling out the new backup server for the infra team, I have
> discovered that several podling sites are extremely large.
>
>  Namely:
>
>  119M    /x1/www/incubator.apache.org/activemq

graduated -> activemq.apache.org

>  324M    /x1/www/incubator.apache.org/cxf

IIRC graduating -> cxf.apache.org

>  102M    /x1/www/incubator.apache.org/directory

graduated -> directory.apache.org

>  166M    /x1/www/incubator.apache.org/lucene.net
>  587M    /x1/www/incubator.apache.org/openjpa

graduated -> openjpa.apache.org

>  299M    /x1/www/incubator.apache.org/servicemix

graduated -> servicemix.apache.org

>  166M    /x1/www/incubator.apache.org/uima

still here :-)

>  I am singling out all sites that over 100MB in size here.  Can someone
> please check the contents of these directories?  I appreciate that some of
> them have graduated from the incubator and as such, these datasets are
> either redundant or should be archived.
>
>  I would appreciate a definitive directive as to what should be done with
> these directories.

IMHO graduate websites should be deleted but probably polite to inform
PMCs first

>  I will also be updating the documentation on how to handle
> graduation/removal from the incubator.  I'll send an update once this has
> been done too.

great

- robert

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@...
For additional commands, e-mail: general-help@...


Re: Size of websites in incubator.apache.org

by Marshall Schor :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

Tony Stevenson wrote:
> Good day,
>
> As part of rolling out the new backup server for the infra team, I
> have discovered that several podling sites are extremely large.
>
> Namely:
>
> ...
> 166M    /x1/www/incubator.apache.org/uima
I checked into this and discovered that > 85% of the space is due to our
keeping various kinds of documentation for our releases on our website,
including ~ 40 MB for the Javadoc API documentation of the "current"
release.

We keep past release documentation here (but not the API Docs), for 2
other past releases - these take ~ 40 MB.

We ended up keeping our documentation in SVN and checking it out onto
the website, after a long discussion of pros/cons, ending with this in a
note from Robert Burrell Donkin, concerning where to keep the
documentation and Javadocs, in which he said:

... <snip>
i talked it over the the infra team and their strong recommendation
was to store in svn and then checkout onto the website
... <snip>

You can see the whole email thread here:
http://www.mail-archive.com/uima-dev@.../msg05150.html

Based on this, I would like to keep things as they are, unless there is
a new conclusion about where things like documentation should go.

-Marshall Schor

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@...
For additional commands, e-mail: general-help@...


Re: Size of websites in incubator.apache.org

by Justin Erenkrantz :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

On Tue, Apr 22, 2008 at 10:56 AM, Marshall Schor <msa@...> wrote:
>  Based on this, I would like to keep things as they are, unless there is a
> new conclusion about where things like documentation should go.

Nah - that's fine.  The issue is the TLPs that have graduated and left
a bunch of stuff in their incubator dirs.  -- justin

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@...
For additional commands, e-mail: general-help@...


Re: Size of websites in incubator.apache.org

by Tony Stevenson-2 :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

Justin

Is the prefereed method of handling these, moving them to archive.a.o/incubator.a.o/($podling)/ ?? Or is there an alternate location?


Tony
Sent from my BlackBerry® wireless device

-----Original Message-----
From: "Justin Erenkrantz" <justin@...>

Date: Tue, 22 Apr 2008 10:59:05
To:general@...
Subject: Re: Size of websites in incubator.apache.org


On Tue, Apr 22, 2008 at 10:56 AM, Marshall Schor <msa@...> wrote:
>  Based on this, I would like to keep things as they are, unless there is a
> new conclusion about where things like documentation should go.

Nah - that's fine.  The issue is the TLPs that have graduated and left
a bunch of stuff in their incubator dirs.  -- justin

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@...
For additional commands, e-mail: general-help@...


Re: Size of websites in incubator.apache.org

by Justin Erenkrantz :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

On Tue, Apr 22, 2008 at 12:53 PM, Tony Stevenson <tony@...> wrote:
> Justin
>
>  Is the prefereed method of handling these, moving them to archive.a.o/incubator.a.o/($podling)/ ?? Or is there an alternate location?

If the projects have indeed graduated, and they already have
$podling.apache.org up, and no one responds to clean them up within,
say, a week, I'd toss 'em entirely and enforce a redirect from the old
incubator.apache.org URL to the new <tlp>.apache.org site.

If you feel charitable and want to save the artifacts on the backup
box somewhere (since they're already copied over) for a little while
longer, feel free...but, IMO, we don't need to persist these sites.
-- justin

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@...
For additional commands, e-mail: general-help@...


Re: Size of websites in incubator.apache.org

by robert burrell donkin-2 :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

On Tue, Apr 22, 2008 at 11:02 PM, Justin Erenkrantz
<justin@...> wrote:
> On Tue, Apr 22, 2008 at 12:53 PM, Tony Stevenson <tony@...> wrote:
>  > Justin
>  >
>  >  Is the prefereed method of handling these, moving them to archive.a.o/incubator.a.o/($podling)/ ?? Or is there an alternate location?

probably not

>  If the projects have indeed graduated, and they already have
>  $podling.apache.org up, and no one responds to clean them up within,
>  say, a week, I'd toss 'em entirely and enforce a redirect from the old
>  incubator.apache.org URL to the new <tlp>.apache.org site.

+1

>  If you feel charitable and want to save the artifacts on the backup
>  box somewhere (since they're already copied over) for a little while
>  longer, feel free...but, IMO, we don't need to persist these sites.

+1

- robert

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@...
For additional commands, e-mail: general-help@...


Re: Size of websites in incubator.apache.org

by Craig L Russell :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

With 587 MB, OpenJPA wins and is still champion. ;-)

It's ok to redirect the incubator site to openjpa.apache.org.

Craig

On Apr 22, 2008, at 10:03 AM, Robert Burrell Donkin wrote:

> On Tue, Apr 22, 2008 at 5:22 PM, Tony Stevenson <tony@...>  
> wrote:
>> Good day,
>>
>> As part of rolling out the new backup server for the infra team, I  
>> have
>> discovered that several podling sites are extremely large.
>>
>> Namely:
>>
>> 119M    /x1/www/incubator.apache.org/activemq
>
> graduated -> activemq.apache.org
>
>> 324M    /x1/www/incubator.apache.org/cxf
>
> IIRC graduating -> cxf.apache.org
>
>> 102M    /x1/www/incubator.apache.org/directory
>
> graduated -> directory.apache.org
>
>> 166M    /x1/www/incubator.apache.org/lucene.net
>> 587M    /x1/www/incubator.apache.org/openjpa
>
> graduated -> openjpa.apache.org
>
>> 299M    /x1/www/incubator.apache.org/servicemix
>
> graduated -> servicemix.apache.org
>
>> 166M    /x1/www/incubator.apache.org/uima
>
> still here :-)
>
>> I am singling out all sites that over 100MB in size here.  Can  
>> someone
>> please check the contents of these directories?  I appreciate that  
>> some of
>> them have graduated from the incubator and as such, these datasets  
>> are
>> either redundant or should be archived.
>>
>> I would appreciate a definitive directive as to what should be done  
>> with
>> these directories.
>
> IMHO graduate websites should be deleted but probably polite to inform
> PMCs first
>
>> I will also be updating the documentation on how to handle
>> graduation/removal from the incubator.  I'll send an update once  
>> this has
>> been done too.
>
> great
>
> - robert
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: general-unsubscribe@...
> For additional commands, e-mail: general-help@...
>
Craig Russell
Architect, Sun Java Enterprise System http://java.sun.com/products/jdo
408 276-5638 mailto:Craig.Russell@...
P.S. A good JDO? O, Gasp!



smime.p7s (3K) Download Attachment

Re: Size of websites in incubator.apache.org

by Craig L Russell :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

In case it wasn't clear, the incubator/openjpa web site already  
redirects to the live site.

There's no need to preserve it. It's obsolete.

Craig

On Apr 22, 2008, at 9:32 PM, Craig L Russell wrote:

> With 587 MB, OpenJPA wins and is still champion. ;-)
>
> It's ok to redirect the incubator site to openjpa.apache.org.
>
> Craig
>
> On Apr 22, 2008, at 10:03 AM, Robert Burrell Donkin wrote:
>
>> On Tue, Apr 22, 2008 at 5:22 PM, Tony Stevenson <tony@...>  
>> wrote:
>>> Good day,
>>>
>>> As part of rolling out the new backup server for the infra team, I  
>>> have
>>> discovered that several podling sites are extremely large.
>>>
>>> Namely:
>>>
>>> 119M    /x1/www/incubator.apache.org/activemq
>>
>> graduated -> activemq.apache.org
>>
>>> 324M    /x1/www/incubator.apache.org/cxf
>>
>> IIRC graduating -> cxf.apache.org
>>
>>> 102M    /x1/www/incubator.apache.org/directory
>>
>> graduated -> directory.apache.org
>>
>>> 166M    /x1/www/incubator.apache.org/lucene.net
>>> 587M    /x1/www/incubator.apache.org/openjpa
>>
>> graduated -> openjpa.apache.org
>>
>>> 299M    /x1/www/incubator.apache.org/servicemix
>>
>> graduated -> servicemix.apache.org
>>
>>> 166M    /x1/www/incubator.apache.org/uima
>>
>> still here :-)
>>
>>> I am singling out all sites that over 100MB in size here.  Can  
>>> someone
>>> please check the contents of these directories?  I appreciate that  
>>> some of
>>> them have graduated from the incubator and as such, these datasets  
>>> are
>>> either redundant or should be archived.
>>>
>>> I would appreciate a definitive directive as to what should be  
>>> done with
>>> these directories.
>>
>> IMHO graduate websites should be deleted but probably polite to  
>> inform
>> PMCs first
>>
>>> I will also be updating the documentation on how to handle
>>> graduation/removal from the incubator.  I'll send an update once  
>>> this has
>>> been done too.
>>
>> great
>>
>> - robert
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: general-unsubscribe@...
>> For additional commands, e-mail: general-help@...
>>
>
> Craig Russell
> Architect, Sun Java Enterprise System http://java.sun.com/products/jdo
> 408 276-5638 mailto:Craig.Russell@...
> P.S. A good JDO? O, Gasp!
>
Craig Russell
Architect, Sun Java Enterprise System http://java.sun.com/products/jdo
408 276-5638 mailto:Craig.Russell@...
P.S. A good JDO? O, Gasp!



smime.p7s (3K) Download Attachment

Re: Size of websites in incubator.apache.org

by dkulp :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message


I was adding cxf to the site-publish/.htaccess file and was going to go
ahead and add the others that have graduated, but I want to double check
something first.

Several of the graduated projects have created their own .htaccess in
their project directory wrather that use the top level .htaccess.  
Example: servicemix/.htaccess

The question is: is it better to leave it like that or move them to the
top level .htaccess and completely remove the project directory?  I
don't really care which, but consistency is probably good and which ever
way we go, it should be documented in the post graduation checklist
stuff.

Dan




On Tuesday 22 April 2008, Tony Stevenson wrote:

> Good day,
>
> As part of rolling out the new backup server for the infra team, I
> have discovered that several podling sites are extremely large.
>
> Namely:
>
> 119M    /x1/www/incubator.apache.org/activemq
> 324M    /x1/www/incubator.apache.org/cxf
> 102M    /x1/www/incubator.apache.org/directory
> 166M    /x1/www/incubator.apache.org/lucene.net
> 587M    /x1/www/incubator.apache.org/openjpa
> 299M    /x1/www/incubator.apache.org/servicemix
> 166M    /x1/www/incubator.apache.org/uima
>
>
> I am singling out all sites that over 100MB in size here.  Can someone
> please check the contents of these directories?  I appreciate that
> some of them have graduated from the incubator and as such, these
> datasets are either redundant or should be archived.
>
> I would appreciate a definitive directive as to what should be done
> with these directories.
>
> I will also be updating the documentation on how to handle
> graduation/removal from the incubator.  I'll send an update once this
> has been done too.
>
>
> Cheers,
> Tony
>
>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: general-unsubscribe@...
> For additional commands, e-mail: general-help@...



--
J. Daniel Kulp
Principal Engineer, IONA
dkulp@...
http://www.dankulp.com/blog

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@...
For additional commands, e-mail: general-help@...


Re: Size of websites in incubator.apache.org

by robert burrell donkin-2 :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

On Wed, Apr 23, 2008 at 7:20 PM, Daniel Kulp <dkulp@...> wrote:

>
>  I was adding cxf to the site-publish/.htaccess file and was going to go
>  ahead and add the others that have graduated, but I want to double check
>  something first.
>
>  Several of the graduated projects have created their own .htaccess in
>  their project directory wrather that use the top level .htaccess.
>  Example: servicemix/.htaccess
>
>  The question is: is it better to leave it like that or move them to the
>  top level .htaccess and completely remove the project directory?  I
>  don't really care which, but consistency is probably good and which ever
>  way we go, it should be documented in the post graduation checklist
>  stuff.

AIUI using the top level .htaccess is better for performance so that's
what i recommend (but hopefully someone will jump in and correct me if
i'm wrong)

- robert

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@...
For additional commands, e-mail: general-help@...


Re: Size of websites in incubator.apache.org

by Justin Erenkrantz :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

On Wed, Apr 23, 2008 at 11:26 AM, Robert Burrell Donkin
<robertburrelldonkin@...> wrote:
>  AIUI using the top level .htaccess is better for performance so that's
>  what i recommend (but hopefully someone will jump in and correct me if

+1.

(Not having .htaccess at all is actually best; but that requires us
tweaking the master httpd conf files whenever a PMC wants a redirect -
doable but feh.)  -- justin

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@...
For additional commands, e-mail: general-help@...


Re: Size of websites in incubator.apache.org

by dkulp :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message


OK.  I've gone ahead and updated the graduation guide to point to the top
level .htaccess file.  (I hope no one minds someone not on the incubator
PMC updating that.)

I also added several of the projects to the .htaccess file.  Basically,
the graduated projects that used a simple .htaccess or a meta-refresh
(yes, a couple are doing that, ick) I've put in the .htaccess.  Thus,
their directories could be removed.  

However, a couple projects are using an .htaccess that is much more
complex than a simple "one liner" so I left them as is.  (example:
ftpserver)

Dan


On Wednesday 23 April 2008, Justin Erenkrantz wrote:

> On Wed, Apr 23, 2008 at 11:26 AM, Robert Burrell Donkin
>
> <robertburrelldonkin@...> wrote:
> >  AIUI using the top level .htaccess is better for performance so
> > that's what i recommend (but hopefully someone will jump in and
> > correct me if
>
> +1.
>
> (Not having .htaccess at all is actually best; but that requires us
> tweaking the master httpd conf files whenever a PMC wants a redirect -
> doable but feh.)  -- justin
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: general-unsubscribe@...
> For additional commands, e-mail: general-help@...



--
J. Daniel Kulp
Principal Engineer, IONA
dkulp@...
http://www.dankulp.com/blog

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@...
For additional commands, e-mail: general-help@...


Re: Size of websites in incubator.apache.org

by gnodet :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

On Tue, Apr 22, 2008 at 6:22 PM, Tony Stevenson <tony@...> wrote:

> Good day,
>
>  As part of rolling out the new backup server for the infra team, I have
> discovered that several podling sites are extremely large.
>
>  Namely:
>
>  119M    /x1/www/incubator.apache.org/activemq
>  324M    /x1/www/incubator.apache.org/cxf
>  102M    /x1/www/incubator.apache.org/directory
>  166M    /x1/www/incubator.apache.org/lucene.net
>  587M    /x1/www/incubator.apache.org/openjpa
>  299M    /x1/www/incubator.apache.org/servicemix

incubator.apache.org/servicemix is already redirecting to servicemix.apache.org
I'll clean the remove the content of the directory asap.

>  166M    /x1/www/incubator.apache.org/uima
>
>
>  I am singling out all sites that over 100MB in size here.  Can someone
> please check the contents of these directories?  I appreciate that some of
> them have graduated from the incubator and as such, these datasets are
> either redundant or should be archived.
>
>  I would appreciate a definitive directive as to what should be done with
> these directories.
>
>  I will also be updating the documentation on how to handle
> graduation/removal from the incubator.  I'll send an update once this has
> been done too.
>
>
>  Cheers,
>  Tony
>
>
>
>
>  ---------------------------------------------------------------------
>  To unsubscribe, e-mail: general-unsubscribe@...
>  For additional commands, e-mail: general-help@...
>
>



--
Cheers,
Guillaume Nodet
------------------------
Blog: http://gnodet.blogspot.com/

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@...
For additional commands, e-mail: general-help@...


Re: Size of websites in incubator.apache.org

by Emmanuel Lecharny :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

Tony Stevenson wrote:
> Good day,
Hi

> 102M    /x1/www/incubator.apache.org/directory

we have exited the incubator 3 years ago ... This directory can be
archived or removed at will.

Thanks !

--
--
cordialement, regards,
Emmanuel Lécharny
www.iktek.com
directory.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@...
For additional commands, e-mail: general-help@...


Re: Size of websites in incubator.apache.org

by Erik Hatcher :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

>>  166M    /x1/www/incubator.apache.org/lucene.net

i've just removed some (large) old docs.

        Erik


---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@...
For additional commands, e-mail: general-help@...